Least squares estimators for discretely observed 
stochastic processes driven by small Levy noises 

Hongwei Long"'* Yasutaka Shimizu^ and Wei Sun*^ 

""Department of Mathematical Sciences, Florida, Atlantic University 
Boca Raton, Florida 33431-0991, USA 
^ Graduate School of Engineering Science, Osaka University 
Toyonaka, Osaka 560-8531, Japan 
"^Department of Mathematics and Statistics, Concordia University 
Montreal, Quebec H3G 1M8, Canada 

May 18, 2012 



Abstract 

We study the problem of parameter estimation for discretely observed stochastic 
processes driven by additive small Levy noises. We do not impose any moment 
condition on the driving Levy process. Under certain regularity conditions on the 
drift function, we obtain consistency and rate of convergence of the least squares 
estimator (LSE) of the drift parameter when a small dispersion coefficient e — )• 
and n — )■ oo simultaneously. The asymptotic distribution of the LSE in our general 
setting is shown to be the convolution of a normal distribution and a distribution 
related to the jump part of the Levy process. 
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1 Introduction 

Let J-", P) be a basic probability space equipped with a right continuous and increasing 
family of cr-algebras > 0). Let {Lt,t > 0) be a M''-valued Levy process, which is 

given by 

Lt = at + aBt+ I I zN{ds,dz)+ I I zN{ds,dz), (1.1) 

Jo J\z\<l Jo J\z\>l 
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where a = (ai, . . . , ad) G M*^, a = {(Tij)dxr is adxr real- valued matrix, Bt = {Bl, . . . , is 
a r- dimensional standard Brownian motion, N{ds, dz) is an independent Poisson random 
measure on IR+ x (M'^yjO}) with characteristic measure dtv{dz). Here we assume that v{dz) 

is a Levy measure on W'- \ {0} satisfying /Kd\|o}(kP l)v{dz) < oo with \z\ = \jYlt=i ^i- 
The stochastic process X = {Xt,t > 0), starting from xq G W^, is defined as the unique 
strong solution to the following stochastic differential equation (SDE) 

dXt = b{Xt,e)dt + edit, t e [0,1]; Xq = xq, (1.2) 

where ^ G 9 = 9o (the closure of 6o) with 9o being an open bounded convex subset of 
MJ', and b = {bi, . . . ,bd) : M*^ x — >■ M'' is a known function. Without loss of generality, 
we assume that e G (0, 1]. The regularity conditions on b will be provided in Section 2. 
Assume that this process is observed at regularly spaced time points {t^ = k/n, k = 
1, 2, . . . , n}. The only unknown quantity in SDE f ll.2p is the parameter 6. Let G Gq be 
the true value of the parameter 9. The purpose of this paper is to study the least squares 
estimator for the true value based on the sampling data {Xt,^)^^^ with small dispersion 
e and large sample size n. 

In the case of diffusion processes driven by Brownian motion, a popular method is the 
maximum likelihood estimator (MLE) based on the Girsanov density when the processes 
can be observed continuously (see Prakasa Rao [22], Liptser and Shiryaev [T7], Kutoyants 
|14]). When a diffusion process is observed only at discrete times, in most cases the 
transition density and hence the likelihood function of the observations is not explicitly 
computable. In order to overcome this difficulty, some approximate likelihood methods 
have been proposed by Lo [IB], Pedersen [25]- [22], Poulsen [27], and Ait-Sahalia [T]. For a 
comprehensive review on MLE and other related methods, we refer to S0rensen [M]. The 
least squares estimator (LSE) is asymptotically equivalent to the MLE. For the LSE, the 
convergence in probability was proved in Dorogovcev [3] and Le Breton [16], the strong 
consistency was studied in Kasonga [10], and the asymptotic distribution was studied in 
Prakasa Rao [SS]. For a more recent comprehensive discussion, we refer to Prakasa Rao 
|29] . Kutoyants [13] and the references therein. 

The parametric estimation problems for diffusion processes with jumps based on dis- 
crete observations have been studied by Shimizu and Yoshida [32] and Shimizu [30] via 
the quasi-maximum likelihood. They established consistency and asymptotic normality 
for the proposed estimators. Moreover, Ogihara and Yoshida [23] showed some stronger 
results than the ones by Shimizu and Yoshida [32], and also investigated an adaptive 
Bayes-type estimator with its asymptotic properties. The driving jump processes con- 
sidered in Shimizu and Yoshida [32], Shimizu [30] and Ogihara and Yoshida [23] include 
a large class of Levy processes such as compound Poisson processes, gamma, inverse 
Gaussian, variance gamma, normal inverse Gaussian or some generalized tempered sta- 
ble processes. Masuda [22] dealt with the consistency and asymptotic normality of the 
TFE (trajectory-fitting estimator) and LSE when the driving process is a zero-mean 
adapted process (including Levy process) with finite moments. The parametric estima- 
tion for Levy-driven Ornstein-Uhlenbeck processes was also studied by Brockwell et al. 
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[2], Spiliopoulos [36], and Valdivieso et al. [43]. However, the aforementioned papers 
were unable to cover an important class of driving Levy processes, namely a-stable Levy 
motions with a G (0, 2). Recently, Hu and Long [7]- [8] have started the study on param- 
eter estimation for Ornstein-Uhlenbeck processes driven by a-stable Levy motions. They 
obtained some new asymptotic results on the proposed TFE and LSE under continuous 
or discrete observations, which are different from the classical cases where asymptotic dis- 
tributions are normal. Fasen [1] extended the results of Hu and Long [S] to multivariate 
Ornstein-Uhlenbeck processes driven by a-stable Levy motions. Masuda [23] proposed a 
self-weighted least absolute deviation estimator for discretely observed ergodic Ornstein- 
Uhlenbeck processes driven by symmetric Levy processes. 

The asymptotic theory of parametric estimation for diffusion processes with small 
white noise based on continuous-time observations has been well developed (see, e.g., 
Kutoyants [T2l [13], Yoshida [l5l H?], Uchida and Yoshida [H]). There have been many 
applications of small noise asymptotics to mathematical finance, see for example Yoshida 
[15] . Takahashi [37], Kunitomo and Takahashi [TT], Takahashi and Yoshida [SB], Uchida 
and Yoshida [12]. From a practical point of view in parametric inference, it is more realistic 
and interesting to consider asymptotic estimation for diffusion processes with small noise 
based on discrete observations. Substantial progress has been made in this direction. 
Genon-Catalot [5] and Laredo [15] studied the efficient estimation of drift parameters 
of small diffusions from discrete observations when e — )■ and n — oo. S0rensen [33] 
used martingale estimating functions to establish consistency and asymptotic normality 
of the estimators of drift and diffusion coefficient parameters when e — )■ and n is fixed. 
S0rensen and Uchida [35] and Gloter and S0rensen [6] used a contrast function to study 
the efficient estimation for unknown parameters in both drift and diffusion coefficient 
functions. Uchida [311 SO] used the martingale estimating function approach to study 
estimation of drift parameters for small diffusions under weaker conditions. Thus, in the 
cases of small diffusions, the asymptotic distributions of the estimators are normal under 
suitable conditions on e and n. 

Long [in] studied the parameter estimation problem for discretely observed one- 
dimensional Ornstein-Uhlenbeck processes with small Levy noises. In that paper, the 
drift function is linear in both x and 9 {{b{x,6) = —Ox), the driving Levy process is 
Lt = aBt + bZt, where a and b are known constants, {Bt, t > 0} is the standard Brownian 
motion and Zt is a a-stable Levy motion independent of {Bt,t > 0}. The consistency 
and rate of convergence of the least squares estimator are established. The asymptotic 
distribution of the LSE is shown to be the convolution of a normal distribution and a 
stable distribution. In a similar framework. Long [20] discussed the statistical estimation 
of the drift parameter for a class of SDEs with special drift function b{x, 9) = 6b{x). Ma 
[2Tj extended the results of Long [19] to the case when the driving noise is a general Levy 
process. However, all the drift functions discussed in Long [191 120] and Ma [21] are linear 
in 6, which restricts the applicability of their models and results. In this paper, we allow 
the drift function b{x, 6) to be nonlinear in both x and 6, and the driving noise to be a 
general Levy process. We are interested in estimating the drift parameter in SDE (11.21) 
based on discrete observations {Xt^}f^l when £ — )■ and n — )■ oo. We shall use the least 
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where Atk-i = tk - tk-i 



1/n. Then the LSE 9n,e is defined as 



arg min '^n,e{(^)- 



Since minimizing \I/„^£(6') is equivalent to minimizing 



we may write the LSE as 



arg min ^n,e{(^)- 



We shall use this fact later for convenience of the proofs. 

In the nonlinear case, it is generally very difficult or impossible to obtain an explicit 
formula for the least squares estimator 6n,£- However, we can use some nice criteria in 
statistical inference (see Chapter 5 of Van der Vaart [H] and Shimizu [31] for a more gen- 
eral criterion) to establish the consistency of the LSE as well as its asymptotic behaviors 
(asymptotic distribution and rate of convergence). In this paper, we consider the asymp- 
totics of the LSE 6'„ £ with high frequency {n — )■ oo) and small dispersion (e — 0). Our 
goal is to prove that 6n^s ~^ in probability and to establish its rate of convergence and 
asymptotic distributions. We obtain some new asymptotic distributions for the LSE in 
our general setting, which are the convolutions of normal distribution and a distribution 
related to the jump part of the driving Levy process. 

The paper is organized as follows. In Section 2, we state our main result with some 
remarks and examples. We establish the consistency of the LSE ^„ and give its asymp- 
totic distribution, which is a natural extension of the classical small-diffusion cases. All 
the proofs are given in Section 3. 

2 Main results 

2.1 Notation and assumptions 

Let = {X^, t > 0) be the solution to the underlying ordinary differential equation 
(ODE) under the true value of the drift parameter: 



For a multi-index m = (mi, . . . , nik), we define a derivative operator in 2; G M'^ as 9™ := 
^Ti' ■■■^Tu^ where := d^^ldz'^\ Let C'=''(M'^ x e;M) be the space of all functions 



dXl 



h{X\ 
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/ : M*^ X G — M which is k and / times continuously differentiable with respect to x 
and e, respectively. Moreover C^'\R'^ x 6; M) is a class of / G C'^'^R'^ x 6; M) satisfying 
that supggQ \dgd^f{x,9)\ < C(l + for universal positive constants C and A, where 
a = («!,..., Op) and (3 = {(3i, . . . , f3ct) are multi-indices with < J2^=i^i — ^ 
— J2i=i Pi — ^1 respectively. 

We introduce the following set of assumptions. 

(Al) There exists a constant K > {] such that 



It is well-known that SDE (1.2) has a unique strong solution under (Al). For conve- 
nience, we shall use C to denote a generic constant whose value may vary from place to 
place. For a matrix A, we define \A\'^ = tr(AA^), where is the transpose of A. In 



\b{x, 6) - b{y, 9)\ < K\x - y\; |6(x, e)\ < K{1 + |a;|) 



for each x,y and ^ G 0. 
(A2) 6(-,-) G C^^'^^iW^ X 0;R). 

(A3) e^e^ ^ b{X^, 9) ^ 6(X°, ^o) for at least one value of t G [0, 1]. 
(A4) /(6'o) = (/*-'(^o))i<i,j<p is positive definite, where 




particular, \a 



2.2 Asymptotic behavior of LSE 

The consistency of our estimator 6n,e is given as follows. 
Theorem 2.1 Under conditions (A1)-(A3), we have 




'0 



as £ — 7- and n ^ oo. 



The next theorem gives the asymptotic distribution of 6n,e- As is easily seen, our result 
includes the case of S0rensen and Uchida |35] as a special case. 



Theorem 2.2 Under conditions (A1)-(A4), we have 





I-\9o)S{9o) 



(2.1) 



as e 0, n —> oo and ne — )■ oo, where 
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Remark 2.3 One of our main contributions is that we no longer require any high-order 
moments condition on X as in, e.g., S0rensen and Uchida f35^ and others, which makes 
our results applicable in many practical models. 

Remark 2.4 In general, the limiting distribution on the right-hand side of Ii2.1\) is a 
convolution of a normal distribution and a distribution related to the jump part of the 
Levy process. In particular, if the driving Levy process L is the linear combination of 
standard Brownian motion and a-stable motion, the limiting distribution becomes the 
convolution of a normal distribution and a stable distribution. 

Remark 2.5 When d = 1 and b{x,d) = —Ox, i.e., SDE ^LMj is linear and driven by a 
general Levy process, Theorem \2.2\ reduces to Theorem 1.1 of Ma I21f . When the driving 
Levy process is a linear combination of standard Brownian motion and a-stable motion. 
Theorem \2.B was discussed in Long fWf and Ma 121^ . 



Remark 2.6 Our results and arguments in the paper can be extended to the SDEs driven 
by small semi-martingale noises. 

Example 2.7 We consider a one- dimensional stochastic process in U.2\) with drift func- 
tion b{x, 6) = 9i-\- 62X. We assume that the true value 60 = [6^, O^) of 9 = {61, 62) belongs 
to 00 = (ci, C2) X (c3, C4) C with Ci < C2 and C3 < c^. Then, X° satisfies the following 
ODE 

dx^ = {6^ + elxf)dt, X° = xo. 

The explicit solution is given by X^ = e^2*xo + -^-^-^ — — when ^2 7^ 0; X^ = xq + 9it when 
Q\ = 0. The LSE 9n,e = idn,e,i, dn,e,2V of Oq IS givcn by 



1 Y2 _ n Y^" Y V 



Note that de^hi^x.Q) = 1 and de2b{x,6) = x. In this case, the limiting random vector in 
Theorem 2.2 is I-\9o){J^ dL„ X^dL.f , where 



J.'x^ds J^iX^Yds 



Example 2.8 We consider a one- dimensional stochastic process in U.2\) with drift func- 
tion b{x, 9) = \/9 + x"^ . We assume that the true value 9q of 9 belongs to 9o = (ci, C2) C M 
with < Ci < C2 < 00. Then, X° satisfies the following ODE 



dX^ = J9o + {X^ydt, X° = xo. 
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The explicit solution is given by X, 



_ (a^o+y/eo+xg)^ 



-do 



* 2(xo+-y/eo+x2)e* 

^n,e of 6 is a solution to the following nonlinear equation 



It is easy to verify that the LSE 



E 



1. 



,=1 ^|e + xl_^ 



Since it is impossible to get the explicit expression for One, we solve the above equation 



numerically (e.g. by using Newton's method). Note that dgb{x,9) 



It is clear that 



the limiting random variable in Theorem 2.2 is I ^(^o) Jq ^ ^x^^ y,^^^' where I{Qq) = 

lo i{eQ+\x")^) '^^ ■ po-fticular, we assume that Lt = aBt + cyZt, where Bt is the standard 
Brownian motion and Zt is a standard a-stable Levy motion independent of Bt. Let us 
denote by N a random variable with the standard normal distribution and U a random 
variable with the standard a-stable distribution S'q(1,/3,0), where a G (0,2) is the index 
of stability and (5 G [—1, 1] is the skewness parameter. By using the self- similarity and 
time change, we can easily show that the limiting random variable in Theorem 2.2 has the 
identical distribution as 



ar^eo)N + ai-\eo) 




2^60 + 



ds 



l/a 



u. 



Example 2.9 We consider a two-dimensional stochastic process in U.2\) with drift fun c- 
tionb{x,9) = C+Ax, where C = {01,02)'^, A = (Aj)i<ij<2 andx = {xi,X2)^. We assume 
that the eigenvalues of A have positive real parts. We want to estimate 9 = {9i, 



.9,V 



(ci. All, A12, 02, A21, A22Y G e C M^, whose true value is 9q = (c?, A^^, A^^, c^, A^^, A'^^V . 



Then satisfies the following ODE 



dX^ = {Co + AoX^)dt, X° = xo. 
The explicit solution is given by X^ = e^«*Xo + /J e'^'^^'-'^Cods. After some basic calcula- 
tion, we find that the LSE 9n,e = {9n,e,i)i<i<6 is given by 

\ 

and 







9n,e,2 


1 - ( 


\ 9n,e,3 I 





(1) 

(1) ^(1) 

(1) v(2) 




where Xt^J_^ (i = 1,2) are the components of Xt^_^, Y^:^' (i = 1,2) are the components of 



Yk = Xt^ - Xt^_i, and 



( 



Ar. 



n 



(1) 



k=l ^ifc-i 



En / -^(1) 
k=l 



V 



En x^{2) v^n 
k=l^tk-i l^k=l^tk-i^tk-i 



(1) vi^) 



Y-n x^(2) \ 
2^k=l \ 

k=l^tk-i^t^-i 
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Since it is easy and straightforward to compute the partial derivatives dg-b{x, 6), 1 < i < Q, 
and the limiting random vector in Theorem 2.2, we omit the details here. 

3 Proofs 

3.1 Proof of Theorem [231 

We first establish some preliminary lemmas. In the sequel, we shall use the notation 

y^"'^ := X[nt]/n 

for the stochastic process X defined by (11 ■2p . where [nt] denotes the integer part of nt. 

Lemma 3.1 The sequence {Y^'^} converges to the deterministic process {X^} uniformly 
on compacts in probability as e — )■ and n — )■ oo. 

Proof. Note that 

Xt-X^= [ {b{Xs,eo)~b{Xleo))ds + eLt. (3.1) 
Jo 

By the Lipschitz condition on b{-) in (Al) and the Cauchy-Schwarz inequality, we find 
that 







2e^\Lt\^ 



{biXs,eo)-b{xleo))ds 

< 2t [ \b{Xs, Oo) - b{Xl eo)\^ds + 2e^ sup |L, 

Jo 0<s<t 

< 2KH [ \Xs-X^\^ds + 2e^ sup 

Jo 0<s<t 



By Gronwall's inequality, it follows that 



|X,-X,Y<2£V^'*' sup |L,|2 

0<s<t 

and consequently 

sup \Xt - X^\ < V2ee^'^' sup \Lt\, (3.2) 

0<t<T 0<t<T 

which goes to zero in probability as e — for each T > 0. Since [nt]/n — > t as n — )■ oo, 
we conclude that the statement holds. 



Lemma 3.2 Let r^'^ = inf{t > : \X^ \ > m or \Y^''^\ > m}. Then, r^'^ — > oo a.s. 
uniformly in n and e as m ^ oo. 
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Proof. Note that 



By the hnear growth condition on b and the Cauchy-Schwarz inequahty, we get 



Xt = xo+ / b{Xs,9o)ds + eLt 



W < 2(|xo|+£|Li|)2 + 2 







2 

< 2(\xo\+e snp \Ls\] + 2t \b{Xs,9o)\'^ds 

0<s<t J Jo 
2 

+ 2Kh / {l + \Xs\fds 
Jo 



< 2 \ \xq\ + e sup \Ls 

0<s<t 



< 



2{\xo\ +e sup \Ls\)'' + AKH"^ 

0<s<t 



+ AKH / \Xs\'ds. 
'o 



GronwaU's inequahty yields that 



< 



2{\xo\+e sup \Ls\y + 4:KH^ 

0<s<t 



4^2*2 



and 



\XA < 



V2{\xo\+e sup \Ls\) + 2Kt 

0<s<t 



2i^2f2 



Thus, it follows that 



V2i\xo\ + sup \Ls\) + 2Kt 

0<s<t 



2+2 



which is almost surely finite. Therefore the proof is complete. 



□ 



We shall use Vx/(a;, 6) = {dx^f{x, 9), . . . , d^^f^x, 6)Y to denote the gradient operator of 
/(x, 6) with respect to x. 



Lemma 3.3 Let f e Cj'^(R'^ x e;R). Assume (Al)-(A2). Then, we have 

k=i 

as £ — )■ and n — )■ oo, uniformly in 6 E Q. 
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Proof. By the differentiability of the function f{x,6) and Lemma 3.1, we find that 



sup 

6»G0 



n „i 
^ k=l 



sup 

6»G0 



' f{Y:'^,9)ds- f f{Xle)ds 
Jo 
1 





<sup / \fiY:'\e)-f{xie)\ds 

eee Jo 



< 



sup f\vjnx^ + uiY^'^ - X"), 9) . - Xs)du 
eee Jo Jo 

< f (f sup |v./(x° + u{y:^^ - e)\d^ \y:^^ - X',\ds 

Jo \Jo fee / 

< / C{l + \X^,\ + \Y^^'\Y\Y^^' -X^^\ds 
Jo 

<C(l+ sup sup |X,|) sup |F,"'"-X°| 

V 0<s<l 0<s<l / 0<s<l 



"0. 



as £ — )■ and n — )■ oo. 



□ 



Lemma 3.4 Let f G C^'^R'^ x Q;R). A ssume (A1)-(A2). Then, we have that for each 
1 < i < d and each 6 E 



fe=i 



e)dL\ 



as e — 7- anc? n — )> oo, where 



L\ = ait + ^^OijBl + I I ZiN{ds,dz)+ f f ZiN{ds,dz) 

Jo J\z\<l Jo J\z\>l 



is the i-th component of L^. 
Proof. Note that 



n „i 

J2fiXt,_,,e){Ll-LlJ= / f{Yr,9)dLl 

k=l 



Let LI = LI — J\z\>i ZiN{ds, dz). Then, we have the following decomposition 



[' f{Yr,e)dLi- f f{xle)dL\ 

10 Jo 



{f{Y:'^,e)-f{xle))z.N{ds,dz) 



J\z\>l 
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+ / {fiY:'^,e)-f{x',,e))dLi. 

Jo 

Similar to the proof of Lemma 3.3, we have 

{f{Y:'^,e)-f{xle))z,N{ds,dz) 

\f{Y:'\e)-f{xle)\\z,\N{ds,dz) 



"'|z|>l 



< 



< 



J\z\>l 



[ [ C{1 + + \Y^''''\y\Y,'''' - X°J\zi\N{ds,dz) 

Jo J\z\>l 

<c(l+ sup sup \Xs\] sup [ [ 

\ 0<s<l 0<s<l J 0<s<\ Jo J\z\>l 



\zi\N{ds, dz), 



which converges to zero in probabihty as e — ?■ and n — oo by Lemma 3.1. By using the 
stopping time r^'*^. Lemma 3.1, Markov inequahty and dominated convergence, we find 
that for any given r] > and some fixed m 



P 



{fiYr,e)-f{Xl9))U,^^^..dLl 







> T] 



< 



V Jo 



E[|/(n"^^)-/(^°,^)lW-}] ds 



+ 



E[|/(rr,^)-/(^r,^)l'W,r}] ds 



1/2 



+ 



- ( f'^[\fiYr,0) - /(X°,^)pl^,<,..J ds . I 
V \Jo >^kl<i 



\zi\'^u{dz] 



1/2 



(3.3) 



which goes to zero as e — ?■ and n — !■ oo. Then, we have 



P 



{f{Yr,9)-f{Xl9))dLl 



> 1] 



< P{t^^ < 1) + P 



which converges to zero as £ — > and n — )■ oo by Lemma 3.2 and ( 13. 3p . This completes 
the proof. □ 



Lemma 3.5 Let f G Cj'^((R"' x 0;M). Assume (A1)-(A2). Then, we have that for 
l<i<d, 

n 

Y,f{X,,_,,9)iXl-Xl_^-k{X,,_„9o)At,.,)^0 

k=l 
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as e ^ and n — )■ oo, uniformly in 6 & Q, where XI and hi are the i-th components of 
Xt and b, respectively. 



Proof. Note that 

K = + I b,iX„ e,)ds + e{V,^ - LIJ. 
It is easy to see that 

n 

J2fiXt,_.,o){xi-xi_^-b,{x,^_,,e,)At, 



k=l 



J2 / f{Xt,^,,e){b,{x,,eo)-k{Xt,_,,9o))ds 

n 
k=l 

' f{Y:^', 9){k{X,, 9o) - 9o))ds + e f f{Y:^', 9)dL\ 

^0 

By the given condition on / and the Lipschitz condition on 6, we have 



sup 

6*60 



f{Y:^^,9){h{X,,9o) - h{Y:'^,9,))ds 



< / SM^\f{Y:^\9)\-K\X,-Yr\ds 
Jo eee 

< KC [\l + lY^'^miX, - X0| + in"'^ - X^\)ds 

Jo 

<Kc(l+ sup \Xt\] ( sup \X, - X°| + sup lY^'^ - X^l), 

V 0<t<l / 0<s<l 0<s<l 

which converges to zero in probabihty as e — )■ and n — )■ oo by Lemma 3.1. Next using 
the decomposition of Lj, we have 



sup 

dee 



e f f{Yr,9)dL\ 
Jo 



< e sup 



a, I f{Yr,9)ds 





+ e sup 
See 



fiYr,9)J2^rjdBi 



+ e sup 

See 



+ e sup 



f{Y;'',9)z,Nids,dz) 



J\z\<l 
1 



fiY;'',9)ziN{ds,dz) 



^|z|>l 
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It is clear that 



e sup 

See 



ai / f{Y:'\e)ds 





< e\ai\C / (1 + \Y^'''\fds 



< e\ai\C I 1 + sup \Xsi j , 

V 0<s<l 

which converges to zero in probabihty as e — ?■ and n — oo, and 



e sup 



' / f{Y:'',9)z,N{ds,dz) 

"'|z|>l 

<s [' [ sup|/(y;^ 

Jo J\z\>i eee 
1 

n,£ I \ A 



< e 



2j|iV((is, dz) 
C{1 + \Y,''''\)^ ■\zi\N{ds,dz) 



'0 "'|2|>l 

<eCri+ sup \Xs\] f [ \zi\N{ds,dz), 

V 0<s<l / Jo J\z\>l 



which converges to zero in probabihty. Note that 



P I £SUp 

eee 



> 1] 



<P(C^<1) + P|£sup 

6»e0 



> ?7 



Let 



(3.4) 



We want to prove that u\^^{9) -> in probabihty as e — and n — ?■ oo, uniformly in 
^ G O. It suffices to show the pointwise convergence and the tightness of the sequence 
{Kiei')}- -^o^ pointwise convergence, by the Chebyshev inequality and Ito's isometry, 
we have 



< 



E4) ^V'^'e [i/(rr,^)pi{.<.-|] ds 
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< 



[\[C\l+\Yr\r'l{s<r-^^}] ds 

Jo 



2X 



(3.5) 



which converges to zero as e — )■ and n — )■ oo with fixed m. For the tightness of {u^ ei')}^ 
by using Theorem 20 in Appendix I of Ibragimov and Has'minskii [9], it is enough to 
prove the following two inequalities 



m\ui 



<c, 



(3.6) 
(3.7) 



for 6, 6i, 02 € O, where 2q > p. The proof of (13. 6p is very similar to moment estimates in 
(13. 5 p by replacing Ito's isometry with the Burkholder-Davis -Gundy inequality. So we omit 
the details here. For (13. 7p . by using Taylor's formula and the Burkholder-Davis-Gundy 
inequality, we have 



< e 



1 291 



(/(n"^^2)-/(Fr,^i))'i{.<.-}rf3 



^ ^[11 i^2-^inv,/(i^"^ei+t<e2-^i))pi 



|s<T„i' } 



dvds 



Combining (13. 4p and the above arguments, we have that e sup^g© 
converges to zero in probability as e — )■ and n — )■ oo. Simi 



arly, we can prove that 



/o I\z\<i /(^s"'^' (^)ziN{ds, dz) converges to zero in probability as e — )■ and 



e supege 

n — )■ oo. Therefore, the proof is complete. 
Now we are in a position to prove Theorem 2.1. 
Proof of Theorem 2.1. Note that 

n 

$„,,(^)) = -2Y,{h[Xt,_,,e)-h{Xt,_,,e,)f{Xt,-x,,_,-n-%x,,_,,e,)) 

k=l 



□ 
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1 " 

k=l 



By Lemma 3.5 and let f{x, 9) = bi{x, 6)—bi{x, 6q) (1 < i < d), we have sup^g© \^n] '^^^ — ^ 



as e — )■ and n — oo. By using Lemma 3.3 with /(x, 6) = \h{x, 6) — b(x, Oq)\ , we find 

Ph., 



supegel'^SW - ^Wl ^ as e -> and n ^ oo, where F{9) = f^\b{X^,i 
b{X^ ,6o)\'^dt. Thus combining the previous arguments, we have 



snp\<l>nAd)-m\^0 
e&e 

as e — and n — )■ oo, and that (A3) and the continuity of yield that 

inf F(e) > F(eo) = 0, 

for each 6 > 0. Therefore, by Theorem 5.9 of van der Vaart [H], we have the desired 
consistency, i.e., 6n,e — ^ 6*0 as e -> and n — > oo. This completes the proof. □ 

3.2 Proof of Theorem D 

Note that 

n 

k=i 

Let G„,.W = (G,\,„...,Gy^ with 

n 

GU(^) = Y.{defif{X,,^,,e){X,,-X,,_,~b{X,,_,,9)/\t,.,), ^ = l,...,p. 



fe=i 



and let Kn^O) = VeGn,e{0), which is a p x p matrix consisting of elements K^l^iO) 
dQ.G\ ^{9)^ 1 < i, j < p. Moreover, we introduce the following function 

K^^{e) = !\de^de,bY{xle){b{xle,) - b{xle))ds - r^{e), i < z,j < p. 

^0 

Then we define the matrix function K{9) = {K''^ {6))i<ij<p. 

Before proving Theorem 12.21 we prepare some preliminary results. 

Lemma 3.6 Assume (A1)-(A2). Then, we have that for each i = 1, . . . ,p 



e-'GlM^ I {de^by{Xleo)dL, 
as e 0, n —!> oo and ne oo. 
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Proof. Note that ioi 1 < i < p 

n 

k=l 

= £-i5^(a,,6f(Xi,.,,^o) / {h{X,,e^)-h{Xt^^,M)ds 
k=l -^^k-l 

n 

+ J2{do,bf{X,,_,,eo){L,, - L,,_J 

k=l 

By using Lemma 3.4 and letting f{x, 9) = dg.bj{x, 0) {1 < i < p, 1 < j < c?) with 6 = 6^, 
we have 

^S(^o)= l\de^hf{Y:'^eo)dL,^ [\de^bfiXleo)dL, 
Jo Jo 

as e — )■ and n — t- oo. It suffices to prove that Hi^iiOo) converges to zero in probabihty. 
For Hn):{6o)-, we need some dehcate estimate for the process Xt. For s G [tk-i,tk], we 
have 

Xs - Xt,_, = [ {h{Xu, Oo) - b{Xt,_,,eo))du + b{Xt,_,,eo){s - + e{Ls - Lt,_,). 
Jtk^i 

By the Lipschitz condition on h and the Cauchy-Schwarz inequahty, we find that 



{h{Xu.e^)-h{Xt,_,,e^))du 



tk-i 



+2 {\biX,,_,,eo)\is - +e\L,- L,,_,\y 
+2\n-^\h{Xt^_^,eQ)\+e sup \Ls - Lt,_,\ 

\ tk-l<S<tk 



By Gronwall's inequahty, we get 



\Xs-Xt,_^^<2\n-'\h{X,,_,,eo)\+e sup |L, - L,,_J ) e 

\ tk-i<s<tk 

It further follows that 



2i<-2n-l(s-ifc_i) 



sup \X,-Xt,_,\ < V2 I n~^\h{Xt^^^,e^)\+e sup \L,-U^_^\ ) e^'/"' (3.^ 

tk-i<s<tk V tk-i<s<tk 
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Thus, by the Lipschitz condition on b and (13.81) . we get 



{biXs,eo)-b{Xt,_,,eo))ds 



\H^%Oo)\<e^'J2\^oMXt,-.,Oo)\ 

k=l 

k=l 

1 1 



A+l 

< oo a.s. 



n 

<{ner'Kj2\deMXt.-.,0o)\ sup \X, - X,,_,\ 

< ■ -Y.\9<^MXt,.,,eo)\ ■ \b{x,^_„eo)\ 

k=l 

+ ^^^ Y.\doMXtk..,Oo)\ sup |L.-L,,_J 

It is easy to see that Hn,e^\6o) converges to zero in probabihty as ra5 — )■ oo since 
-f2\deM^tk.,,0o)\-\b{X,,^,,9o)\<CK(l+ sup 

(cf. f l3.2p ). By using the basic fact that 

1 " 

-V sup \Ls- Lt^_,\ = op{l), 

we find that 

/ \ 1 " 

i^l'^'n^o)< v^i^e^'/"'C7 1+ sup |X,| -V sup \L,-Lt,_,\, 

V o<s<i / n tk-i<s<tk 

which converges to zero in probabihty as e — and n — oo. Therefore the proof is 
complete. □ 

Lemma 3.7 Assume (Al)-(A4)- Then, we have 

sup\Kn,e{0)-K{9)\^0 

as £ — )■ and n — )■ oo. 
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Proof. It suffices to prove that for 1 < i, j < p 

sup \Kli9)-IC^ (9)1^0 

6»g0 

as e — )■ and n — > oo. Note that 

n 
k=l 

n 



n 

i=l 

-{dg^bf{x,^_,,e)de^b{x,^_,,e 

By using Lemma 3.5 and letting /(x, 9) = dg^dg.bi{x, 0) [1 < i, j < p, 1 < / < rf), we have 

that supggQ \Kn,i^\6)\ converges to zero in probabihty as e — ?■ and n — ?■ oo. By using 
Lemma 3.3 and letting /(x, 9) = {dg^defifix, e){b{x, eo)-b{x, 9)) - {de^^ix, e)de^b{x, 6), 

it follow that supggQ \Kn,'e'^\6) — K^^{6)\ converges to zero in probability as e — j- and 
n — oo. Thus, the proof is complete. □ 

Finally we are ready to prove Theorem 12.21 

Proof of Theorem 12.21 The proof ideas mainly follow Uchida |39]. Let B{9o; p) = {9 : 
\9 — Oo\ < p} for p > 0. Then, by the consistency of 9n,e, there exists a sequence rjn^e 
as £ -7- and n — oo such that B{9o;rin,e) C 9o, and that Pgol^n.e ^ B {9o; rjn^e)] ~^ 1- 
When 9n,e ^ B{9o; rjn^e), it follows by Taylor's formula that 

where Dn,e = Jq Kn,e{9o + u{9n,e - 9o))du and Sn,e = s^^{9n,e - 9o) since B{9o; rjn^e) is a 
convex subset of 6o. We have 

\Dn,e - Kn,s{9o)\l!§,^^eB(eo;-n^,)} < SUp \Kn,e{9) - Kn,e{9o)\ 

eeB{eo;vn,e) 
< sup \KnA^)-K{9)\ 

eeB(6»o;r?„,E) 

+ sup \K{9) - 7^(^0)1 + \KnA^o) - K{9o)\. 
Consequently, it follows from Lemma 3.7 that 
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Note that K{6) is continuous with respect to 6. Since —K{9q) = I{6o) is positive definite, 
there exists a positive constant 6 > such that inf|^|=i \K{6o)w\ > 26. For such a 
6 > 0, there exists e{6) > and N{6) G N such that for any e G (0,5(5)), n > N{6), 
B{9o; 7]n,e) C Go and \K{e) - Js:(^o)| < ^2 for 9 E B{9o; r/„,,). For such S > 0, let 

sup |i^„,,(^) - K{9o)\ < ^, ^„„, G 5(^o; ^n. 
|e-e„|<»7n,. ^ 

Then, for any e G (0,5(5)) and n > A^(5), we have, on F^^^, 



sup \{Dn,e - K{9o))w\ < SUp 
|i«|=l \w\=l 



+ sup 

l«,l=l 



K{9o + u{9r,,e - 9o))du w 



K{9o + u{9n,e-9o))du-K{9o)]w 



< sup \KnM-K{9)\ + -<5. 



Thus, on F^ 



inf \Dn,e'w\ > inf \K{9o)w\ - sup |(D„,, - fs:(^o))?i'| > 25 - 5 = 5 > 0. 
|«)|=i ' |i«|=i 1^1=1 

Hence, letting 

Vn,e = {Dn,e IS iuvertible, 9n,e e B{9o;r]n,e)}, 
we see that P^of^n.e] > -Peofrn,^] — j- 1 as 5 — )■ and n — oo by Lemma 3.7. Now set 

Un,£ = Dn,£lVn,e + Ipxp^V^^^ 

where Ipxp is the identity matrix. Then it is easy to see that 

\Un,e - K{9o)\ < \Dn,e ' K{9o)\lv„,, + |/pxp - K{9o)\lv^^^ ^ 0, 

since Po^lVn^e] — )• 1. Thus, by Lemma 3.6, we obtain that 

= U~l{-e''^Gn^s{9o))lv,,,, + 'S'n,el0S,, 

^ {I{9o)r' {^j\de,hY{Xl9o)dL,,..., j\dehY{Xl9,)dL^, 



as 5 — )■ 0, n — > oo and ra5 — > oo. This completes the proof. 



□ 
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